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1. Introduction 

It is a well-known challenge to include the fermion determinant into the Boltzmann factor of 
the desired ensemble. Many applications, like algorithms with Metropolis acceptance-rejection 
steps or reweighting methods, require the ratio of such determinants. The main problem are the 
fluctuations of the ratio due to the stochastic and the ensemble noise. In order to use and to improve 
such methods it is essential to understand these fluctuations. In [p we presented an algorithm 
where we use the knowledge of these fluctuations to establish the Partial-Stochastic-Multi-Step- 
algorithm which reaches a high acceptance rate of 60% up to moderate lattice sizes of (1.2fm) 4 . 
With this experience and some techniques we used, we study here the fluctuations in the case of 
mass reweighting. 

In these proceedings we will analyze the scaling of mass reweighting [||] by factorizing the 
fluctuations into UV- and IR-dominated terms. This is done by using domain decomposition [0]. 
The determinant of the (Wilson-)Dirac operator is then detD = detDdetD wn ,detD/^, where the 
Schur complement is given by D = 1 — D^Dj^D^D^ with the Dirac operator in block notation 



D 



Dbb Db w 



and accordingly D 1 



%b %w 
&wb &ww 



(1.1) 



The operator Dbb (D ww ) is a block-diagonal matrix with the black (white) block Dirac operators on 
the diagonal. The Schur complement can be restricted to the support of D w b (using the projector 
P defined by D w bP = D w b) without changing its determinant and its inverse is then of the form 
D 1 = l-P% w D wb . 

2. Two Flavor Mass Reweighting 

The idea of mass reweighting is to reuse an ensemble which is generated at a specific mass 
mi (the ensemble mass) at a different mass mi (the target mass). This is possible by correcting 
the Boltzmann factor of the ensemble The correction for a configuration U enters as the 
reweighting factor W(U,mi,m2) which is given by 

m , r , \ det D(U,m 2 ) N f 1 

W(U,m 1 ,m 2 ) = ) ' = tt (2.1) 

with D(U,m) the (Wilson-)Dirac operator, Nf the number of flavors (=2) and the ratio matrix 
M = D 1 (U,m2)D(U,mi). The reweighting factor introduces additional noise in the evaluation of 
observables (0) m , = -^Sy^ 1 , the ensemble fluctuations. One can avoid the exact evaluation of the 
determinant by an unbiased stochastic estimation of the integral 



1 



t 1 Nh " t i 

- = y Dfo]Dfe Vpf-^MT!} "^-D* (2.2) 

where T],- are complex Gaussian noise vectors, Nhu is the number of the estimates and one estima- 
tion costs one inversion of the Dirac operator. This estimation introduces stochastic fluctuations 
which are negligible if and only if the ensemble fluctuations dominate the statistical error of the 
measurement. We analyze these fluctuations by reweighting two Nf = 2 CLS-ensembles (0(a) 
impr. Wilson fermions) of two different sizes 48 x 24 3 and 64 x 32 3 at /3 = 5.3 (a = 0.066 fm) 
from the pseudoscalar mass of mps = 440 MeV to the target mass of mp$ = 310 MeV [@]. 
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2.1 Stochastic Fluctuations 

The variance for Nun = 1 of the stochastic estimation is given by [||] 

2_ 1 1 

° s ~ det(2AftM- 1) ~ (detMtM) 2 ' ( ' ) 

It follows that the variance is only defined if all eigenvalues of the ratio matrix M ' M are larger than 
1/2 and that every eigenvalue which is equal to one produces no stochastic noise. So every method 
which shifts the eigenvalues of the ratio matrix to one improves the estimation. We will shortly 
motivate and present two different methods which fulfill this condition, mass interpolation [Q] and 
domain decomposition |^. 

It is obvious that the mass reweighting factor W(Ui,m\,m2) is known if the spectrum of the 
Wilson-Dirac operator D(Uj,m 2 ) is known 

detM-' = fl w A f ( :; 2)) (2-4) 

where Am = m\ — ni2- The product is dominated by the IR-modes. The eigenvalues of the ratio 
matrix M = 1 + Am ■ D 1 (m-i) are given by 

X{M) = 1 +Am-A(D~ 1 (wi 2 )). (2.5) 

For the case that there is no negative eigenvalue it follows that X(M T M) > 1 (for Am > 0). So if 
we use the Wilson-Dirac operator each eigenvalue produces stochastic noise in particular also the 



UV-modes. If we use the Schur complement, the operator D~ l (ni2) in Eq. (2.5) is replaced by 
an operator [S>bw{ m 2] — @ww ( m 2)D^ v (mi )D w b]D^ b (mi )Dt, w which could have eigenvalues with a 
negative or vanishing real part. With the Schur complement the ratio matrix has a spectrum which 
is distributed around one. 



In addition Eq. ( |2.5[ ) implies that a smaller mass-shift would shift the eigenvalues closer to 
one. This is easily achieved if one introduces an interpolation in the mass and by splitting up 
the ratio matrix in several ratio matrices D(m\) /D(m2) = {D(m\) / D(m,-)} {D(/n,-) / D(m2)} ■ This 
technique works only if no real eigenvalue of the Wilson-Dirac operator becomes negative. In this 
case the ratio matrix gets eigenvalues which are smaller than 1/2 and the stochastic estimation fails. 
If this happens one has to use additional methods, like exact eigenvalue calculation, to calculate the 
reweighting factor in an appropriate way. The estimation of the reweighting factor with the domain 
decomposition is now given by 

^ det D?(m 2 ) 

W = W g rY\ tt^t (2-6) 

where k labels the white and black blocks. For moderate block sizes Z 4 < 6 4 the exact calculation 
of the block determinants is feasible while the global factor W g i is estimated by using N mass 
interpolation steps and N%u estimations of each ratio 

N ( i Nut t , , t , . 1 

„, ; n ' ^„ " ••<> (2.7) 

i=l I "hit /=1 
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Figure 1: The figures show the stochastic fluctuations in the case of two flavor mass reweighting by using 
mass interpolation and domain decomposition for one configuration of the 48 x 24 3 ensemble. In the left 
figure we analyze the scaling in the number of mass interpolation steps N while the total number of global 
inversions N • is fixed to 640. We plot the estimated relative stochastic variance a 2 (W)/W 2 against the 
number of interpolation steps N. The analysis shows that for N > 8 it makes no difference if one increases 
Nf,it or N. If the eigenvalues of Ml Mi are close enough to one a 2 (W)/W 2 scales with 1/AW/,,,. We fit the 
asymtotic plateau for the total operator (star,blue), the even-odd preconditioned operator (equivalent to a 
Schur complement with blocks of length 1=1) (diamonds, magenta), the Schur complement with 6 4 -blocks 
(circle, red) and with 12 4 blocks (square,black). The right figure shows the results of the plateau fit against 
the total dimension of the operators divided by the dimension of the global Wilson-Dirac operator. For the 
Schur complement we only take the dimension of the projector P into account. 



where the z'th ratio matrix is given by M,- = D _1 (mj)i)(mj_i) with the Schur complement D(m ; ) 
depending on the z'th mass m, = i/N • ni2 + (N — i)/N • m\ . Inverting the Schur complement costs 
one inversion of the Dirac operator. 

In practice it is now easy to control the stochastic fluctuations by changing the number of 
inversions N -Nhu, this is possible as long as there is no zero-crossing of the eigenvalues of D{m). 
To avoid a wrong estimation (zero-crossings for m\ > mi) it is necessary to control the variance of 



each factor in ( |2.7[ ), which can be estimated by setting Nhu > 6. Increasing N or Nhu is comparable, 
if the eigenvalue distribution of the ratio matrix M + M is close to one, which can be achieved by 
increasing Af to sufficient value (see Fig. (|I])). 

Fig. ([[]) also shows the effect of using the Schur complement instead of the total Dirac operator. 
For block sizes / > 1 it is two times more efficient to use the Schur complement. Another conclusion 
is that stochastic fluctuations do not scale with the dimension of the operator. It is obvious that the 
remaining IR-modes dominate the fluctuations. In general we find that the stochastic fluctuations 
scale with Am 2 V / (N ■ N hit ) . 



2.2 Ensemble Fluctuations 

The ensemble fluctuations enter the game if one wants to calculate an observable, which is 



given for the target mass by (0),„ 2 
form MM 



(ow)„ M 



The total variance of such an observable gets the 
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Figure 2: The figures show the scaling of (J 2 (lnW) with the volume V and the mass shift Am by using 
60-100 configurations. The left figure shows the fluctuations of the global factor c 2 (lnW) multiplied by 
1 /VAm 2 against several mass shifts Am while on the left side we plot the small volume and on the right side 
the bigger one (we write the parameters for the bigger in brackets). The ensemble noise is estimated with 
Nhu — 6, /V = 16, (48) and domain decomposition with 6 4 (8 4 ) blocks. The 8 4 blocks are decomposed further 
in a 8 4 Schur complement with Dirichlet boundaries and 4 4 blocks. The right figure show the fluctuations 
for the global Schur complement multiplied by 1 / \/V. Am 2 against several mass shits and different Schur 
complements. The constant fit illustrate the weak volume dependence while y/V is an upper estimate for 
this dependence. 



<50 2 /var(W) \ 
var(0)/W „ ft ~_ w (^ + 1 j (2 .8) 

where 80 2 is the variance of the observable without the reweighting factor, N cn f g the total number 
of configurations and x corr the autocorrelation time. We observe that the mass reweighting factor is 
distributed like a log-normal distribution p(W) ~ ^ exp j — ^ ln ^ ff2 ^ j ■ Then it is straightforward 
to show that the ensemble noise is given by 

with a 2 = var(lnW). 

In order to study the scaling of o 2 , we fix the stochastic noise to a small and volume indepen- 
dent value. From Fig. (Q) it follows directly that a 2 = k\ • Am 2 ■ V for some constant k\, while we 
observe that the Schur complement has only a weak V dependence. We appraise it with \/V . The 
V dependence of W emerges through a large correlation between the factors of the block operators 
and the Schur complement. In general mass reweighting in large volume is limited to small values 
of Am. 

3. One Flavor Mass Reweighting 

Nature motivates one flavor reweighting. There are many effects which depend only on the 
specific quark, like isospin splitting of the up- and down-quark. Also for corrections of a not exactly 
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tuned strange quark mass it is necessary to calculate the mass reweighting factor for one flavor. For 
that we introduce the integral 

1 r 1 Nhh + 

— = yDfo]Dfo^{-nten}^ — £ e -^-^ (3 .i) 

which is well defined only if Re(r]^MT]) > V T] / 0. The variance of the stochastic estimation is 
given by 

2_ 1 1 

°* ~ det(Mt+M- 1) ~ detMtM ( ' ' 

which is defined if X(M* +M) > 1. So it is possible to estimate the reweighting factor 1 / detM as 
long as the variance is defined. In general the scaling of one flavor mass reweighting is comparable 
with the two flavor case (see Fig. (Q)), but obviously there are some differences. The estimate 
is complex. In practice one can use this to improve the estimator: because of the 75 Hermiticity 
the expectation value is real and one can neglect the imaginary part. We found that this trick 
improves the estimation by a factor two compared to the square root trick In the case that a 
real eigenvalue becomes negative it is not possible by using mass interpolation to ensure that the 
integral is defined. Another issue is that the estimate being complex could have a negative sign. 
We do not detect such problem if we suppress the stochastic noise to a proper level which is easily 
achieved by increasing the number of mass interpolation steps. 

The proposed one flavor estimation has many advantages and should be used in future appli- 
cations. 



4. Conclusion 

In these proceedings we analyze the scaling behavior of mass reweighting by studying the 
stochastic and ensemble fluctuations with the methods mass interpolation and domain decompo- 
sition. We find that the stochastic fluctuations scale like Am 2 V / (N ■ Nhu) ■ By using domain de- 
composition with block sizes with I > 1 the fluctuations are reduced at least by a factor two. The 
ensemble fluctuations of the full operator scales like Am 2 V while for the Schur complement the 
volume dependence is weaker and compatible with Am 2 \/V. 

2 

Assuming that <7 2 0t (\nW) = a 2 (In IV) + the cost for the mass reweighting of the total op- 
erator can be deduced from the number of the original configurations needed, given by 

^ = N eff • exp {Am 2 -v(k l + -jg-) } + ^(Am 3 ) (4.1) 

for constants k{ and lc 2 which depends on the ensemble parameters. Here we use the definition 
of the number of effective configuration N e /f = (var(W) / (W) 2 + l) of and an analytic 
expansion of Eq. ( p3| ). For N e /f = 50 we get N cn f g /x corr = 1517(241509 (for the bigger volume) ) 
for Af • Nhit = 32 by fixing k 2 / {k\N],i t N) =0.11. If one consider a reweighting range of Am/ 2 the 
numbers change to N cn f g /x corr = 117(741) using the same numbers of inversions N -Nhu = 32. 
The total cost of evaluating the reweighting factor only scales with the volume V but through the 
V dependence of the ensemble fluctuations mass reweighting becomes rapidly inefficient for larger 
volumes. This limits the reweighting range in m. 
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Figure 3: The figures show the scaling of the stochastic fluctuations in the case of one flavor mass reweight- 
ing for one configuration of the smaller volume like in Fig. (|lj). We compare the one flavor reweighting 
with the root-trick [Q]. The root-trick is a biased estimator by using the square root of the two-flavor es- 
timation 1/detM = v //D[T7]D[T]t]exp{-77tMtM77} 1 /N hit ^{ exp{-T]J (M^M - 1)77,}. The right 
figure shows the estimated relative variance a 2 (W)/W 2 of the global Schur complement with 12 4 blocks 
against the number of mass interpolation steps N. The total number of inversions for each point is constant 
with N -Nijjt = 640. The plot shows the difference of the stochastic fluctuations for one flavor case Eq. (3.1) 



(red,triangle) and the root-trick (blue,diamonds). One can see that for all points the variance is finite, the 
one flavor integral exists. The right figure shows the constant fit to the 1 /AW/„- f -plateau for several operators 
(compare Fig. (|l|) 



To conclude there are many more details to discuss and to describe in a more general style in 
the framework of mass reweighting, like an analytic formula to characterize the stochastic estima- 



tion, estimation with zero crossings or a proof for the one flavor integral Eq. (3.1). We want to 
address this soon in an adequate way. 
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